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Production of Complex Human 
Glycoproteins in Yeast 

Stephen R. Hamilton,* Piotr Bobrowicz,* Beata Bobrowicz,* 
Robert C. Davidson/ Huijuan Li,* Teresa Mitchell,* Juergen H. 
Nett,* Sebastian Rausch,* Terrance A. Stadheim,* Harry 
Wischnewski,* Stefan Wildt,* Tillman U. Gerngrossj 

We report the humanization of the glycosylation pathway in the yeast Pichia 
pastoris to secrete a human glycoprotein with uniform complex N- glycosyla- 
tion. The process involved eliminating endogenous yeast glycosylation path- 
ways, while properly localizing five active eukaryotic proteins, including man- 
nosidases I and II, N-acetylglucosaminyl transferases I and II, and uridine 5'- 
diphosphate (UDP)-rV-acetylglucosamine transporter. Targeted localization of 
the enzymes enabled the generation of a synthetic in vivo glycosylation path- 
way, which produced the complex human N-glycan /V-acetylglucosamine 2 - 
mannose 3 ^/V-acetylglucosamine 2 (GlcNAc 2 Man 3 GlcNAc 2 ). The ability to gen- 
erate human glycoproteins with homogeneous rV-glycan structures in a fungal 
host is a step toward producing therapeutic glycoproteins and could become 
a tool for elucidating the structure-function relation of glycoproteins. 



The ability to produce recombinant human 
proteins has led to major advances in human 
health care and remains an active area of drug 
discovery. Some proteins such as insulin, 
which are not glycosylated in their native 
state, do not require glycosylation and can 
thus be expressed in prokaryotic hosts that 
lack a glycosylation machinery, such as 
Escherichia coli. Most therapeutic proteins, 
however, require the cotranslational addition 
of glycans to specific asparagine residues of 
the protein to ensure proper folding and sub- 
sequent stability in the human serum. For 
therapeutic use in humans, glycoproteins re- 
quire human-like N-glycosylation. Mamma- 
lian cell lines that are able to replicate 
human-like glycoprotein processing have sev- 
eral drawbacks including low protein titers, 
long fermentation times, heterogeneous prod- 
ucts, and ongoing viral containment issues. 

Yeast and filamentous fungi are robust 
industrial fermentation organisms that can be 
grown to high cell density in chemically de- 
fined medium. However, glycoproteins de- 
rived from fungal expression systems contain 
nonhuman N-glycans of the high mannose 
type (Fig. 1), which are immunogenic in hu- 
mans and thus of limited therapeutic value 
(7). Nevertheless, fungi and mammals share 
initial steps of protein N-glycosylation, 
which involve the following: (i) the site- 
specific transfer of Glc 3 Man 9 GlcNAc 2 from 
the luminal side of the endoplasmic reticulum 
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(ER) to the de novo synthesized protein by an 
oligosaccharyl transferase complex, (ii) trim- 
ming by glucosidases I and II, and (iii) the 
removal of one specific terminal ot-1,2- 
mannose residue by an ER-residing a- 1,2- 
mannosidase. These sequential steps lead to 
the formation of Man 8 GlcNAc 2 structures 
(isomer Man8B) (Fig. 1), the Af-glycan found 
on most glycoproteins leaving the fungal or 
mammalian ER. After the export of pre- 
dominantly Man 8 GlcNAc 2 -containing gly- 
coproteins to the Golgi, jV*-glycan process- 
ing pathways diverge notably between 
mammals and yeast (2). The human Golgi 
contains several ot-l,2-mannosidases (IA, 
IB, and IC), which remove mannose to 
yield Man 5 GlcNAc 2 , a precursor for com- 
plex Af-glycan formation (Fig. 1). 

In Saccharomyces cerevisiae, N-glycosyl- 
ation involves the addition of numerous man- 
nose sugars throughout the entire Golgi, 
which often leads to hyperrnannosylated N- 
glycans with more than 100 mannose resi- 
dues. When this first a-l,6-mannose is added 
by 1 ,6-mannosyltransferase (Ochlp), addi- 
tional a-l,6-mannosyltransferases extend the 
ot-l,6-chain, which then becomes the sub- 
strate for a- 1,2- and a- 1,3 -mannosyl trans- 
ferases, as well as phospho-mannosyltrans- 
ferases that add yet more mannose sugars to 
the growing JV-glycan structure (3). 

Pichia pastoris is a methyl otrophic yeast 
with a glycosylation machinery that is similar 
to that of S. cerevisiae and is frequently used 
for the expression of heterologous proteins 
(4). YJN201, a recombinant strain of P. pas- 
(oris, lacks endogenous Ochlp activity, and 
contains three heterologous genes encoding 
a- 1 ,2-mannosidase localized to the ER, 
UDP-GlcNAc transporter, and GlcNAc trans- 
ferase I localized to the Golgi (Fig. 1) (5), 



Two of these proteins (MnsI and GnTI) are 
synthetic fusions between fungal type II 
membrane proteins and catalytic domains 
from Caenorhabiditis elegans and humans, 
respectively. YJN201 has the ability to se- 
crete the kringle 3 domain of human plasmin- 
ogen (K3) as a reporter protein, with N- 
glycans of the hybrid type (5). Here, we 
further humanize the N-glycosylation path- 
ways in P. pastoris. 

To identify a compatible combination of Mn- 
sll and GnTII, a two-stage approach was chosen. 
First, a combinatorial library of several MnsI I 
catalytic domains was fused to a signal peptide 
library of more than 60 fungal type II membrane 
localization signals. The resulting 500 combina- 
torial fusion constructs were introduced into the 
P. pastoris strain YSH1, described in the subse- 
quent paragraph, capable of producing the hu- 
man precursor of complex glycosylation, 
GlcNAcMan 5 GlcNAc 2 (Fig. 2B, compare Fig. 
2 A) on the reporter K3. Only a small subset of 
strains (<5%) were capable of quantita- 
tively converting GlcNAcMan 5 GlcNAc 2 to 
GlcNAcMan GlcNAc 2 (Fig. 2C). These strains 
were isolated and subsequently transformed with 
a combinatorial library of several hundred Gn- 
TH-leader peptide fusions. Screening for the 
presence of GlcNAt^Ma^GlcNA^ allowed 
the isolation of strains that were able to 
secrete homogeneous complex glycan (Fig. 
2D), as exemplified by strain YSH44. 

Pichia pastoris YSH44 was engineered 
from BK64-1 (f5), an ochl deletion mutant 
secreting K3, a reporter protein with a single 
N-linked glycosylation site (5). After the in- 
troduction of the Kluyveromyces lac t is uri- 
dine 5 '-diphosphate (UDP)-GlcNAc trans- 
porter (5), the strain was further used for the 
introduction and screening for active a- 1 ,2- 
mannosidase fusions. Among 608 chimera, 
the mouse MnsIA catalytic domain fused to 
the N-terminal localization peptide of the ER 
protein Sec 12 from S. cerevisiae was selected 
because of its ability to produce primarily 
Man 5 GlcNAc 2 . Subsequent introduction of a 
human GnTI fusion library allowed the iso- 
lation of a strain (PBP-3) able to produce 
primarily GlcNAcMan 5 GlcNAc 2 structures 
(Fig. 2B). The URA marker in PBP-3 was 
recovered, which produced the strain YSH1. 
Transformation of the latter strain with a 
library of MnsII catalytic domains fused to a 
library of localization peptides permitted 
the isolation of yeast strains producing 
GlcNAcMan 3 GlcNAc 2 as the primary N- 
glycan, with some GlcNAcMan 4 GlcNAc 2 
and GlcNAcMan 5 GlcNAc 2 also present (Fig. 
2C). This strain, YSH37, contains a catalytic 
domain of the Drosophila melanogaster man- 
nosidase II (7), fused to the N-terminal local- 
ization signal of the S. cerevisiae Golgi pro- 
tein Mnn2. Finally, we generated YSH44 by 
introducing a GnTII-localization peptide li- 
brary and screening for strains that are able to 
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produce K3 with uniform GlcNAc 2 Man 3 
GlcNAc 2 structures (Fig. 2D). Specifically, 
this strain contained the catalytic domain of 
GnTII from Rattus norvegicus and the leader 
peptide from S. cerevisiae protein Mnn2. As 
expected, the N-glycans from YSH44 were 
substrates for both hexosaminidase and 
p-l,4-galactosyltransferase (Fig. 2, E and F). 
Unlike the parent strain BK64-1, YSH44 no 
longer displayed flocculation behavior in 
culture and was able to produce similar 
amounts of recombinant protein when com- 
pared with the commercially available 
wild-type P. pastoris strain GSI 15 (Fig. 3). 
These strains both demonstrated a similar 
degree of occupancy on the single N-linked 
site of K3 (-100%) (Fig. 3). 

Being able to produce N-glycans of high 
purity in vivo depends on the efficiency of indi- 
vidual processing steps throughout the secretory 



pathway and the absence of local competing 
reactions for the same substrate. For example, 
YSH37 shows the consequence of producing a 
glycan intermediate in vivo that can also serve as 
a substrate for competing pathways. Although 
GlcNAcMan 3 GlcNAc 2 represented the main N- 
glycan structure produced by this strain, we 
found two additional structures with masses 
corresponding to GIcNAcMaojGlcNAc^ and 
GlcNAcMan 5 GlcNAc2. We performed de- 
tailed analyses by mass spectrometry, high- 
performance liquid chromatography, and en- 
zymatic digestions to determine the struc- 
ture of these contaminating glycans. The 
fourth and fifth mannose residues on the 
GlcNAcMan 3 GlcNAc 2 core were sensitive 
to a-l,2-mannosidase digest, which led us to 
conclude that mannosidase II conversion 
was complete and that subsequent a- 1,2- 
mannosyltransferase activity was involved. 
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Fig, 1. N-linked glycosylation pathway in humans and in P. pastoris. Mns, a-1,2-mannosidase; 
Mnsll, mannosidase II; GnTI, p-1,2-A/-acetylglucosaminyltransferase I: GnTII, p-1 ,2-N-acetylgtu- 
cosaminyltransferase II; MnT, mannosyltransferase. 
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Fig. 2. Matrix-assisted laser desorption time- 
of-flight mass spectrometry (MALDI-TOF) 
spectra of N-linked glycans released from 
recombinant K3. K3 was produced in P. pas- 
toris strains BK64 (5) [GS115 wild-type con- 
trol (Invitrogen, Carlsbad, USA)], PBP-3, 
YSH37, and YSH44 and purified from culture 
supernatants by Ni-affinity chromatography. 
The glycans were released from K3 by pep- 
tide-N-glycosidase F treatment The released 
N-linked glycans were analyzed by MALDI- 
TOF mass spectrometry (positive mode), 
typically appearing as the sodium or potassi- 
um adducts. (A) BK64, (B) PBP-3, (C) YSH37, 
(D) YSH44, (E) glycans from YSH44 after 
(3-A/-acetylhexosaminidase treatment, and 
(F) glycans from YSH44 after p-1,4-galacto- 
syltransferase treatment. To simplify labeling 
of glycoforms, the two core GlcNAc residues, 
although present in all species, have been 
omitted. M, mannose. 
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Fig. 3. Comparative expression levels of K3 in 
wild-type (BK64) (5) versus YSH44 (6) P. pas- 
tom. Analysis of the reporter protein K3 by 
SDS-polyacrylamide gel electrophoresis (SDS- 
PACE) (6). 
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Representation of Action 
Sequence Boundaries by Macaque 
Prefrontal Cortical Neurons 

Naotaka Fuji! and Ann M. Craybiel* 

Complex biological systems such as human language and the genetic code are 
characterized by explicit markers at the beginning and end of functional se- 
quences. We report here that macaque prefrontal cortical neurons exhibit 
phasic peaks of spike activity that occur at the beginning and endpoint of 
sequential oculomotor saccade performance and have the properties of dy- 
namic start- and end-state encoders accompanying responses to sequential 
actions. Sequence bounding may thus reflect a general mechanism for encoding 
biological information. 



This was further substantiated by the intro- 
duction of GnTII, which preempted the ac- 
tion of a-l,2-mannosyltransferase and led 
to the formation of uniform GlcNAc 2 
Man 3 GlcNAc 2 structures (Fig. 2, D and E). 




gal glycosylation pathway was reengineered 
to mirror the processing of human W-glycan 
structures. However, unlike the human path- 
way, which typically results in an array of 
glycoforms, this system yielded essentially 
homogeneous glycoforms. 

To better understand the relation between 
N-glycosylation and protein function, generat- 
ing uniform glycoproteins with specific Af-gly- 
can structures will prove to be of great utility 
(5). At present, the interpretation of these rela- 
tions is compromised by the heterogeneity of 
the glycoform pools that can be generated from 
mammalian sources. These pools are typically 
created by expressing glycoproteins in specific 
glycosylation mutants or by ex vivo enrich- 
ment, such as lectin chromatography or enzy- 
matic treatment Moreover, even when a partic- 
ular structure is identified, it is difficult to 
produce these structures at a commercial scale. 
Being able to engineer human glycosylation 
pathways into yeast strains able to express pro- 
teins with uniform Af-glycan structures offers a 
more practical solution to this problem. When 
single proteins from a library of genetically 
engineered yeasts can be expressed and when 
each produces a defined and uniform glyco- 
form, glycoprotein libraries can be generated to 
elucidate specific structure-function relations 
and to identify the most efficacious glycoform 
for a particular biological function. Once iden- 
tified, a particular glycoform can be readily 
produced at industrial scale because of the ra- 
pid and well-established scale-up of yeast 
fermentations. 
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Speaking, hitting a fastball, and playing the 
flute are premier examples of behaviors re- 
quiring meticulous sequencing of actions. 
These are learned skills, but the ability to 
carry out sequences of movements is funda- 
mental to most normal behaviors of humans 
and other animals (7). These sequential be- 
haviors extend over time, so that temporal 
order as well as spatial order must be regu- 
lated. Many levels of the neuraxis have been 
implicated in the control of sequential behav- 
iors (2-6). The prefrontal cortex, however, is 
a key region involved in the planning and 
automatization of the sequential behaviors 
that enable us to produce a smooth flow of 
actions or thoughts across time (7-77). 

Implicit in the structure of such sequences 
is that they have a clear beginning and end, 
and explicit designations of start and end 
states have proven useful in models of 
sequential behavior (72). To test for such 
representations of start and end states, we 
recorded from multiple neurons in the pre- 
frontal cortex and, for comparison, in the 
frontal eye field (FEF), as highly trained ma- 
caque monkeys performed up to seven se- 
quential saccades (eye movements to redirect 
the line of sight) in blocks of trials (75) (Fig. 
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1A). Successive trials had a clear beginning 
(the illumination of a central red fixation 
point), a clear movement period (in which the 
monkey made saccades to successively pre- 
sented red visual targets), and a final reward 
delivery period (in which the monkeys re- 
ceived juice or water for correct perfor- 
mance). In the standard task, different se- 
quences were presented in pseudo-random 
order within trial blocks. We analyzed neu- 
ronal spike activity in relation to successive 
task events throughout the trials and during 
the intertrial intervals (13). 

Neurons in the FEF responded selective- 
ly to saccades made in particular directions 
(14, 75), and some showed sequence selec- 
tivity. Many neurons in the prefrontal cor- 
tex exhibited similar properties, but in ad- 
dition they responded to multiple phases of 
the task and often exhibited tonic firing 
during the entire trial or during the move- 
ment period (13, 16). Figure IB illustrates 
typical prefrontal responses during the 
four-saccade task, with peaks of activity for 
each saccade during the movement period 
and a smaller peak after initiating fixation. 

Nearly half of the task-related prefrontal 
neurons (295 of 658), but only rare FEF 
neurons (13 of 1 16), had an additional phasic 
peak in firing 270 to 280 ms after the 
sequence of saccades was completed. This 
"extra" peak occurred no matter how many 
saccades were made during the previous 
movement period (Fig. 1, B, E, and F) (fig. 
SI) (73). Even for a single saccade, there 
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